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Let Xi,X2 be independent random walks on ZjJ, d > 3, each 
starting from the uniform distribution. Initially, each site of Zfj is 
unmarked and, whenever Xi visits such a site, it is set irreversibly to 
i. The mean of \Ai\, the cardinality of the set Ai of sites painted by 
i once all of ZjJ has been visited, is n''/2 by symmetry. We prove the 
following conjecture due to Pemantle and Peres: for each d > 3 there 
exists a constant Ud such that lim„_>cx> Ya,r{\Ai\) / hd{n) = \ad where 
h'i{n) — n*, hiiji) — n*(logn), and hd{n) = n'^ for d> 5. We will also 
identify ad explicitly. This is a special case of a more general theorem 
which gives the asymptotics of Var(|^i|) for a large class of transient, 
vertex transitive graphs; other examples include the hypercube and 
the Caley graph of the symmetric group generated by transpositions. 

1. Introduction. Suppose that Xi,X2 are independent random walks 
on a graph G = {V, E) starting from stationarity. Initiahy, each vertex of G 
is unmarked and, whenever Xi visits such a site, it is marked i irreversibly. 
If both Xi and X2 visit a site for the first time simultaneously, then the 
mark is chosen by the flip of an independent fair coin. Let Ai be the set 
of sites marked i once every vertex of G has been visited. By symmetry, it 
is obvious that E|^j| = \\V\. The purpose of this manuscript is to derive 
precise asymptotics for Var(|^j|) for many families of graphs. 

The study of the statistical properties of Ai was first proposed in the work 
of S.R. Gomes Junior et. al. in [ ] as a source of interesting mathematical 
problems which serve as models for the technical challenges associated with 
physical problems involving interacting random walks. Their main result is 
an estimate of the growth of E|S| where B is the interface separating ^1 
from A2 in the special case of the one-cycle Z^. As with E|^j|, computing 
E|i3| for becomes trivial starting with d = 3 since it is easy to see that, 
with probability strictly between and 1, for any pair of adjacent vertices 
x^y, X2 will hit y before Xi conditional on the event that Xi hits x first. 
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On the other hand, estimating Var(|^j|) in this setting poses a much greater 
challenge since its expansion in terms of correlation functions exhibits sig- 
nificant cancellation which, when ignored, leads to bounds that are quite 
imprecise. We will develop this point further at the end of the introduction. 

We remark that the problem we consider was communicated to us by 
Hilhorst [ ] in a slightly different setting. Rather than considering the sets 
of sites Ai , A2 first painted by Xi, X2, respectively, it is also natural to 
study the sets Ai,A2 of sites most recently painted by Xi,X2, respectively. 
In other words, in the latter formulation the constraint that the marks are 
irreversible is removed. It turns out that these two classes of problems are 

equivalent, which is to say (^1,^2) = {Ai,A2)- This helpful observation, 
which follows from the time-reversibility of random walk, was made and 
communicated to us by Comets [2]. 

We restrict our attention to lazy walks Xi , X2 to avoid issues of period- 
icity. That is, the one-step transition kernel is given by 



where x ^ y means that x is adjacent to y in G. The particular choice of 
holding probability ^ is not important for the proof; indeed, any A G (0, 1) 
would suffice. Our proofs also work in the setting of continuous time walks. 
Let •; G) be the i-step transition kernel of lazy random walk on G and 
7r(-;G) its unique stationary distribution. 

Our main result is the precise asymptotics for Var(|^i|) on tori of dimen- 
sion at least three, thus verifying a conjecture due to Pemantle and Peres 



Theorem 1.1. Suppose that Gn = Z^, d > 3. There exists a finite 
constant > such that 



p{x,y;G) 



= < 



'I iix = y, 

n,,^ / \ if X ~ V 



^0 otherwise. 



[11]. 



lim 



Var(IAI) ^ 1 
hd{n) 4 



where 



if d = 3, 
< n'^(logn) if d = 4, and 
^n'^ if d > 5. 
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Our proof allows us to identify ad explicitly and is given as follows. Let 



(1-1) Z^) = Eo ^ l{x(t)=x} 

t=o 

be the Green's function for lazy random walk on Z'^. For d > 5, 

It is not difficult to show that ^ 1 as (i — t- oo, so that Var(|^i|) ~ |n'^ 
for d and n large is close to the variance of an iid marking. For d = 4, 

(1.3) a,= hm \ Yl G\y;Z% 

we will explain why this limit exists and is positive and finite in Proposition 
2.1. The definition of 03 is slightly more involved. Let denote the three- 
dimensional continuum torus, p*(-, •; T^) the kernel for Brownian motion on 
T^, and 

T 



(7^(x,y;T3) ^ / ^t[^^y.^^^di. 
Jo 



Now set 



(1-4) = 7^27^^ f f (//'(^,y;T=') - \T?dxdy, 03 = lim a 
G^(U; Z'^j Jt» JT^ T^ca 



T 
3 • 



The reason that the limit exists and is positive and finite is that p^{x, y; T^) 
converges to the uniform density exponentially fast in t (see Proposition 3.1 
for a discrete version of this statement). 

One interesting remark is that the variance for d = 3,4 is significantly 
higher than that for an iid marking. The results of Theorem 1.1 should also 
be contrasted with behavior of the variance of the range IZ of random walk 
on Z'^ run up to the cover time Tcovi'^n) °f ^n' which is the expected amount 
of time it takes for a single random walk to visit every site. When d > 3, 
Tcov(Z^) ~ Crfn°'(logn) (see [10]). For (i > 5, it follows from work of Jain 
and Orey [6] that Var(|7^|) = 0(n'^(log n)). For d = 3, 4, it follows from 
work on Jain and Pruitt [ ] that Var(|7^|) is 0(n^(logn)^) and 0(n^(logn)), 
respectively. 

This work opens the doors to many other problems involving two random 
walks. Natural next steps include CLTs for the fluctuations of \Ai\ and for 
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the number of sites painted by i at time t, as well as the development of an 
understanding of the geometrical properties of the clusters of Ai- The latter 
seem to be connected to the theory of interlacements, which arise in the 
process of disconnection of a discrete cylinder by a random walk f"^], [12], 
[13]. 

Theorem 1.1 is a special case of a much more general result, which gives 
the asymptotics of Var(|^j|) for many other graphs, such as the hypercube 
and the Caley graph of the symmetric group generated by transpositions. 
We will now review some additional terminology which is necessary to give 
a precise statement of the result. Recall that the uniform mixing time is 

Tmix{G) = mill \t>0: max 
and the Green's function for G is 

9(.x,y;G)= ^ p\x,y;G), 
t=o 

i.e. the expected amount of time Xi spends at y up until Tmix(G) when 
started from x. Let Tj(x) = min{t > : Xi{t) = x}; we will omit i if there is 
only one random walk under consideration. 

Assumption 1.2. (Gn) is a sequence of vertex transitive graphs with 
\Vn\ ^ oo such that 

1. rmix(G„) = o(|K|/(log |K|)^) and lim.„^oo T^iAGn) = oo, 

2- Ey^.9^(.X,y;Gn) = 0{TrnUGn)/l0g\Vn\), and 

3. there exists po < 1 so that P^lTiy) f\T[z) < Tmix(G„)] < po uniformly 
in n and x,y,z £ Vn distinct. 

Note that vertex transitivity implies that g is constant along the diagonal 
and, combined with part (3), implies the existence of go > such that 
g{x, X] Gn) < go uniformly in x and n. 

The general theorem is: 

Theorem 1.3. Suppose that (Gn) satisfies Assumption 1.2. Let fn,cix, y) = 
Px[T{y) < crmix(Gn)], 7n,c = E?/ /n,c(a;, y)vr(y; G„), and 

Fn,c = ^{fn,c{x,y) - 7n,c?- 

x,y 



P ix,y;G) 
niy;G) 



< 
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There exists 7 > so that for every c>2 we have 



(1.5) 



Var(IAI) = U + 0(A„)j F„,, + 0(e-^'=(T^i,(G„))2) 



as n 00 where A„ = (rmix(G'„) log |y„|)/|T4| . 

Applying this to the special cases of the hypercube and the Caley graph 
of Sn generated by transpositions leads to the following corollary. 

Corollary 1.4. Suppose that Gn = {Vn,En) is either the hypercube 
or the Caley graph of Sn generated by transpositions. Then 



In particular, the first-order asymptotics of the variance are exactly the 
same as for an iid marking. 

Throughout the remainder of the article, all graphs under consideration 
shall satisfy Assumption 1.2. In most examples, it will be that T^-^^{Gn) = 
o{Fn^c) so that the second term in (1.5) is negligible. In this case, taking c = 2 
in (1.5) provides a means to compute not only the magnitude but also the 
constant in the first order asymptotics of the variance. In some cases, such as 
Gn = 'Zin, the constant can even be computed when Fn^c = 0((T'mix(G'„))^). 

The challenge in obtaining Theorems 1.1 and 1.3 is that the cancellation 
in the expansion of the variance is quite significant which, when ignored, 
yields only an upper bound that can be off by as much as a multiple of 
TmixiGn)- We will now illustrate this point in the case of for d > 3. It 
will turn out that the contribution to the variance from the sites visited by 
both Xi,X2 simultaneously is negligible, hence we will ignore this possibility 
in the present discussion. Up to negligible error. 



Here, Px,z denotes the joint law of Xi,X2 with -'^i(O) = x and ^2(0) = z 
and TT{-;x,y) is the law of X2(ti(x)) conditional on the event H{x,y) = 
{ti{x) < Ti{y) , T2{x) , T2{y)} . The reason we do not have equality is that 



Var(|A|) = i(l + o(l))|K 




is equal to 
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P[H{x,y)] / J since it could be that Xi,X2 hit either x or y at the same 
time. We will, however, argue in Lemma 4.3 that replacing 'P[H{x,y)] by 
1/4 introduces negligible error. Thus we need to estimate 

(1-6) I ('P-Aniy) < r2{y)] - ^) ^z;x,y). 

x,y,z ^ ^ 

At this point, one is tempted to insert absolute values and then work on 
each of the summands separately. An application of Bayes' rule shows that 
there exists Co > so that 7r{z;x,y) < Cq'it{z]7j'^) (see Theorem 4.1 for a 
much finer estimate), hence the expression in (1.6) is bounded from above 
by 



(1.7) Ci 



x,y 



PxAMy) < My)] - I 



where Px,n denotes the law of Xi,X2 with Xi{0) = x and ^2(6) ~ vr. We 
can analyze Fx,TT[Ti{y) < T2iy)] as follows. First, the probability that X2 
hits y before tc = cn? is of order 'n?~'^ by a union bound. Second, by the 
transience of random walk, the probability that Xi hits y before tc is, up to 
a multiplicative constant, well approximated by g{x,y; Z^). By time tc, Xi 
will have mixed, which means that if neither Xi nor X2 has hit y by this 
time, then the probability that either one hits first is close to 1 /2. The careful 
reader who wishes to see precise, quantitative versions of these statements 
will find such in the lemmas we use to prove Theorem 1.3. Thus it is not 
difficult to see that there exists C2 > so that \Px,TT[Ti{y) < T2{y)] — 1/2| < 
C2g{x,y; Z^), which leads to an upper bound of 



C3^5(x,2/; Zf)<C4n^+2 
x,y 



A slightly more refined analysis leads to a lower bound of (1.7) with the same 
growth rate. As we will show in the next section, in every dimension this 
estimate is typically quite far from being sharp. The reason that it is so poor 
is that by moving the absolute value into the sum in (1.7) we are unable to 
take advantage of the cancellation that arises as Px,n[Ti{y) < T2(y)] > 1/2 
when X is close to y and Px,n[Ti{y) < ^2(2/)] < 1/2 when x is far from y. 

Outline. The remainder of this article is structured as follows. In the 
next section, we will deduce Theorem 1.1 and Corollary 1.4 from Theorem 
1.3. In Section 3, we introduce some notation that will be used throughout in 
addition to collecting several basic random walk estimates. Next, in Section 
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4, we give a precise estimate of the Radon-Nikodym derivative of 7f(-;x,y) 
with respect to vr. In Section 5, we prove Theorem 1.3 and end in Section 6 
with a list of related problems and discussion. 

2. Proof of of Theorem 1.1 and Corollary 1.4. The following 
Proposition will be important for the proof of Theorem 1.1. 

Proposition 2.1. For each c > 1, the limit 

^iP l~n{ y](/",c(2;, y) - ln,cf 

n->oo hd(n) ^ 

exists. When d > 4, it is ad as in (1.2), (1.3). When d = 2>, it is given by 
a§ where is as in (1.4). 

The first step in the proof of the proposition is to reduce the existence 
of the limit to a computation involving Green's functions. Recall from (1.1) 
that G{y; TJ^) is the Green's function for lazy random walk on Z'^. 

Lemma 2.2. For each c > 1, we have that 
n^oo hd{n) ^ 

Proof. Let gc{x,y; Zi) = ESo'" ^{x{t)=y} and observe 
gcix,y;Zi) < P,.[r(y) < cT^My,y;K)- 

We shall now prove a matching lower bound. Fix < c < c. Then we have 
that 

r / cT • \ 

/ mix \ 

9c{x,y;Zi) >^x \ ^ '^{X(t)=y} l{r(j/)<(c-c)r,nix} 

(2.1) >/„,e-?(x,y)(7H(y,y;Z^). 

Assuming c — c > 1, by mixing considerations (see Proposition 3.1) we have 
that 

(2.2) /„,c-c(x, y) = fn,cix^ y) + 0{Zn^-''). 
We also have 

(2.3) gz{y, y; Z^) = g,{y, y; Z^) - J^p^-'^(y, z; Zi)g,_~,{z, y; Z^) 

z 

= g,iy,y;Zi) + 0{n^-^) 
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since c > 0. Combining (2.1), (2.2), and (2.3), we have thus proved the lower 
bound 



gc{x, y; K) > /n,c(x, y)gciy, y, K) + 0((c + g{x, y- Z^))n^-'='). 

Theorem 1.5.4 of [ ,] imphes gc{x,y] Z^) = Q{\x — y\'^"'^) (it is actuahy stated 
for walks on Z'^ which are not lazy, but the generahzation is straightforward) . 
Consequently, 



e(n) if d = 3, 
0(logn) if d = 4, and 
^G(l) if d > 5. 



Hence, 



^(/n,c(x, y)gc{v, y, - 9c{x, y; Zi)f 
x,y 

^ 0{ic + g{x, y; Zt)^-^^ = 0{{Z^ + o{l))n^). 



x,y 



Dividing both sides by hd{n), taking a hmsup as n — )• oo, then as c — )• 
yields 

^™ y"(/n,c(x, y)gciy, y; K) - 9c{x, y; Z^)f = 0. 



x,y 



By (2.3) we know that \gc{y,y; Zf^) — gi{y,y; Z^)| = o(l) and, by transience, 
it is not hard to see that lim„_5.oo gi{y, y, Z^) = G(0; Z*^). □ 

Proof of Proposition 2.1. Letting = cTmix^"'^, Lemma 2.2 im- 
plies that it suffices to prove the existence of the limit 



(2.4) 



^iP 7r^y](9c(a;,y;Z^) -fif^ 



x,y 



We will divide the proof into the cases d > 4 and d = 3. 
Case 1: d > 4. As g^^^ = 0{n'^~'^), we have 

Y,^fn,c + 25„,c5(a^, y, Zi)) = o(l). 

x,y 
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Thus it suffices to sfiow in tliis case tliat lim„_i>oo ^(n) X^j^ (7c(0) 2/5 ^n) ex- 
ists, wfiere hi{n) = logn and h^in) = 1 for d > 5. Tliis will be a consequence 
of two observations. First, as 

\y\>i \y\>e m=l 




o(ri) if d> 5, 

0(log(n/^)) if d = 4, 



it suffices to show that, for each e > and for I = i{n,e) = ^, the 
limit linie^o linirn-oo /irf ^(?^) X^|y|<^ ^clO) y; ^n) exists (we can even restrict 
to finite £ if d> 5). Our second observation is that ^(O, y; ZJ^) — G{y; Z"') = 
0(n2-'^) for 1 2/1 < £. This follows since we can couple the walks on ZJ^ and 
Ti'^ starting at such that they are the same until the first time tq they 
have reached distance n/2 from 0, then move independently thereafter. The 
expected number of visits each walk makes to after time tq is easily seen 
to be 0{n?~'^), which proves our claim. Thus, 

\y\<i 

Therefore if c? > 5 we have 
For d = 4, 

1^"^ ^Eff'(^,2/;Z^) = lim E G'2(y;Z4). 

n^oo riAin) ^ — ' n^oo log n ^ — ' 

^''J^ y6Z4,|j/|<n 

Note that the limit on the right side exists since by Theorem 1.5.4 of [ ] 
(generalized to lazy walks), G{y;Z'^) = ad\y\'^~'^ + o(|y|~") if a G (0,(i) is 
fixed. 



Case 2: d = 3. The thrust of the previous argument was that random walk 
on Z^ for d > 4 is sufficiently transient so that pairs of points of distance 
r2(n^~^) make a negligible contribution to the variance, which in turn allowed 
us to make an accurate comparison between the Green's function for random 
walk on Z'i with that on Z'^. The situation for d = 3 is more delicate since 
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the opposite is true: pairs of distance 0{n^^'^) do not measurably affect the 
variance. 

Theorem 1.2.1 of [9] (extended to the case of lazy random walk, see also 
Corollary 22.3 of [1]) implies the existence of constants /33,73 > such that 
with p*{x, y; Z^) = ^§j2 exp ^ -^g have the estimate 

\p\x,y;Z^) -p\x,y;Z^)\ = \x - y\-^0{t-^/^). 

Hence letting p*(a;,y;Z^) = Ylkez'-^P^i^^V + kn;Z^), one can easily show 
that with 

rT ■ 
* mix 

A{x,y)= \p\x,y;Zl)-p\x,y;Zl)\ 

we have that 

(2.5) j^)E^'(-,y) = om. 

By differentiating p in t, we see that for l<i<s<t + l, we have 

|2^ 



ir (0, y, Zl) - p*(0, y; Z^)] = O Q + p*(0, y; 



We are now going to prove that 

(2.6) Y.\\ P*(0,y;Z3)-pW(o,y;Z3)a!f =0(1). 

It suffices to bound 

rT ■ -X \ 2 

mix 1 \ 



.6Z3 V'^i 



For we apply Cauchy-Schwarz to the integral and invoke the integrability 
of Xjt^ over [l,oo) to arrive at 

^\^,y-Zltdt = 0(X). 
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For B, we insert the formula for p into the integral, make the substitution 
u = t/\y\'^, and then compute to see 



This proves (2.6). Recall that is the three-dimensional continuum torus. 
For x,y G T^, let 

(2.7) gc{x,y;T^)= p\nx,ny;Zl)dt = - p''{x,y;T^)du, 

Jo n Jq 

where T = T^^Jr?. By (2.5), (2.6), we have that 

(5e(x,y;Z3)-5,(x/n,y/n;T3))2 =0(1). 

Therefore we may replace gc{x,y;Z^) in (2.4) with gc{x/n,y/n;T^). Note 
that (7c(t;T3) is the product of and the Green's function for 
where Bt is a Brownian motion on ; roughly, the reason that the Brownian 
motion moves at 1/2-speed is that lazy random walk moves at 1/2 the speed 
of simple random walk. It is left to bound 



/ / {gc{ [nx\ /n, [ny\ /n; T^) - g,{x, y; T')Ydxdy- 

the reason for the pre- factor is that we need to multiply by (n^)^ in 
order to make the double integral comparable to the double summation 
and we also divide by the normalization /i3(n). From (2.7), we see that 
gc{x,y;T^) is 0(n~^)-Lipschitz away from the diagonal = {{x,y) G 
X : |x — yl < e}. Thus since \{x,y) — {[nx\/n, [ny\/n)\ = 0{n~^), 
the integrand is 0(n~^) on D^, hence the integral over is 0(n~^). Since 
both ngc{ \nx\ /n, \ ny\/n; T^) and ng^x, y; T^) are uniformly L^-integrable 
over x T^, it follows that the contribution coming from Dg: can be made 
uniformly small in n by first fixing e > small enough. □ 



We now deduce Theorem 1.1 from Theorem 1.3. 



Proof of Theorem 1.1. Suppose G„ = for d > 3. Then rmix(Z^) = 
Q{n?) (see [lU]) and there exists > so that g{x,y; Z^) < Crf|a; — yp"*^ A 1 
(see [')]). Consequently, the hypotheses of Theorem 1.3 are obviously satis- 
fied, except for possibly (3). This is easy to see if x is sufficiently far from 
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y, z so that g[x^ y; Z^) + g{x, z\ ZJ^) < 1/2. If x is close to y, z then we can 
combine the Green's function estimate with the trivial bound that X start- 
ing at X with distance r to y,z will get to distance r + s without hitting y, z 
with probability at least (4d)~''. 

Proposition 2.1 implies that Fn^c ~ jCK^^c^dl'^) as n — )• oo. This is enough 
to dominate Tj^;j,(Z5^) = 0(n^) except if d = 3. We shall now argue that 
nevertheless, Fn,c is still the dominant term in this case. Note that 



fn,c ^ y'' - 



n 
y 

for some > and c > 2 fixed. Also, the transience of random walk on Z,^ 
implies that there exists > so that fnfi{x,y) > Ai\x — y\~^ A 1. Thus 
for 

'"-^'-(210^)""^^" 
we have that fn,c{x, y) — fnc^ 4^1^ ~ ^1^^ ^ ^- Consequently, 

Fn,c>^ E l^-yr'Ai = c-ie(n^). 

\x—y\<A2n 

A matching upper bound, up to a multiplicative factor, is also not difficult 
to see. 

Our lower bound for Fn^c depends on c by a multiplicative factor of 1/c 
while the second term in (1.5) decays exponentially in c. Thus by taking 
c > 2 large enough we see that Fn^c is still dominant for d = 3. □ 

We now turn to the proof of Corollary 1.4. 

Proof of Corollary 1.4 for the Hypercube. For Zg, it is easier 
to work with the continuous time random walk (CTRW) since the types of 
estimates we require easily translate over to the corresponding lazy walk. 
The transition kernel of the CTRW is 

p\x,y; Z^) = ^{1 + e-2*/n)n-|x-,|(i _ ^-2t/n-^\x-y\ ^ 

where |x — y| is the number of coordinates in which x and y differ. The spec- 
tral gap is 1/n (see Example 12.15 of [10]) which implies r2( 

^) — -^mix(Z2) — 

O(n^) (see Theorem 12.3 of [ ]). Consequently, the first hypothesis of The- 
orem 1.3 holds. If |x — y| = r, then it is easy to see there exists Ce, Pe > 
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so that 

Z^) < 



{C^tjnY exp[-(t/C7,n)(n - r)\ if t < en, 



provided e > is sufficiently small. Thus it is not difficult to see that 
g{x,y;Z'!^) < C'^n"''. Trivially, 

|{yGZMx-y|=r}| = H<n^ 



Thus 

Y^g\x,y;Z-,)<o(±n~^^-.n^=o(l), 

y^x \r=l J ^ ^ 

SO the second hypothesis of Theorem 1.3 is satisfied. The final hypoth- 
esis is obviously also satisfied. Now, a union bound implies that fnc — 
0(2-"rmix(Z^)), which implies (/„,c(x, x) - f^J^ = 1 + o(l). On the other 
hand, 

/„%(x,y) = 0(2"f]n-2-.n'-] =o(2"). 

\x-y\>l \ r=l / 

Putting everything together. Theorem 1.3 implies 

Var(|A|) = ^(l + o(l))2^ 

□ 



Proof of Corollary 1.4 for the Caley graph of Sn- Let G„ be 
the Caley graph of Sn generated by transpositions. By work of Diaconis 
and Shashahani [ ], the total variation mixing time of Sn is B(nlogn), 
which by Theorem 12.3 of [10] implies T^aixiGn) = 0(n(logn)(logn!)) = 
0(n^(logn)^). We are now going to give a crude estimate of p^{a,T] Sn)- 
By applying an automorphism, we may assume without loss of generality 
that a = id. Suppose that d(id, t) = r and that ri, . . . , are transpositions 
such that Tr • • • Ti = T. Then ti, . . . ,Tr move at most 2r of the n elements of 
{1, . . . , n}, say, ki, . . . , k2r- Suppose k'^, . . . , k'2^ are distinct from /ci, . . . , k2r 
and a G S'„ is such that a{ki) = k[ for 1 < i < r. Then the automorphism of 
Gn induced by conjugation by a satisfies ara~^ 7^ r. Therefore the size of 
the set of elements r' in Sn such that there exists a graph automorphism 99 of 
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Gn satisfying (/j(r) = r' and (^(id) = id is at least ( 2^ ) ^ 2-2^n2^((2r)n-i 



assuming n> Ar. Therefore 



(2.8) p\e,T-Gn) < -A^ and5(e,r;G0 < C{2''^ {2r)\) [log nfn 



This bound is good enough for r > 2 but does not quite suffice when r = 1. 
This case is not difficult to handle, however, since it is easy to see that the 
random walk has distance 3 from e with probability 1 — 0(l/n) after its 
first three moves, hence with distance at least 2 from any permutation with 
distance 1 from e. Combining this with (2.8) gives the desired bound. From 
this is it clear that {Gn) satisfies the hypotheses of Theorem 1.3 and, arguing 
as in the case of the hypercube, that 

Var(|A|) = ^(l + o(l))n!. 

□ 

3. Preliminaries. 

3.1. Notation. For x,y £ Vn, let Ti{x,y) = Ti{x) A Tj(y) and T{x,y) = 
Ti{x,y) AT2{x,y). Let H{x,y) = {ri(x) < Ti{y),T2{x,y)}, 

7r{z;x,y) = P[X2(ri(x, y)) = z\H{x,y)], 

and let vr be the uniform measure on Vn- Throughout, Pz[-] denotes the law 
of random walk initialized at z (and the initial distribution is stationary 
whenever z is omitted). We will often not emphasize the dependence of 
various quantities on G„ since it will be clear from the context. Let 

r„ = CoTmixlog IKI, '^" = 7777' '^n = T„log|Ki|, 5„ = ^ 5^ (x, y) 

where cq will be determined later. The proofs in this article will involve 
probabilities of complicated events. To keep the formulas succinct, it will be 
helpful for us to introduce the following notation: let 

Gij{x) = {Ti{x) < Tj{x)}, 

Gij{x,y) = {Ti{x,y) < Tj{x,y)}, and 
Gi{x,y) = {Ti{x) < Ti{y)}. 
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3.2. Elementary Estimates. Recall that the total variation distance of 
measures fj,, u on V is 

\\^ - u\\tv = max|/x(A) - u{A)\ = \ IKlx}) - i'{{x})\. 

The following provides a bound on the rate of decay of the distance oip^{x, •) 
to stationarity. 

Proposition 3.1. For every s,t G N, 
(3.1) max •) — ttWtv < 4 max •) — vr||rv'||p^(y, •) — tt\\tv 



(3.2) max 



x,y 

p*+*(x, y) 



7r(y) 



1 



. P%x,y) t 

< max — max \\p [x, ■) — tt\\tv- 

" y iT[y) X 



Proof. The first part is a standard result; see, for example, Lemmas 
4.11 and 4.12 of ["']. The second part is a consequence of the semigroup 
property: 

^p*+"(x,z) = ^ Vp*(x,y)p"(y,z) 
■k[z) 7r(z) ^ 

= -rT Y\[p\x, y) - 7r(y) + 7r(y)]p''(y, z) 

\ y y 

( p'^iy, z)\ f, 

< max -— \\p (x, •) - vr TV + 1- 

V y,z tt[z) I 



□ 



Note that (3.1) and (3.2) give 

/q Q\ P\x,y) 
(3.3) max 

x,y 



< 76"^° for t > aT„ 



where 7 > is a universal constant. We will often use (3.3) without reference. 

Throughout the article, it will be important for us to have precise esti- 
mates of the Radon-Nikodym derivative of the law of random walk condi- 
tioned on various events with respect to the uniform measure. In the follow- 
ing, we are interested in the case of a random walk conditioned not to have 
hit a particular point. Let = /cTmix- 

Lemma 3.2. There exists 7,^*0 > so that for all k > 1 satisfying 
< Pq and c>2 we have 

Px[X{cTk) = z\T{y) > cTk] = [1 + 0(e-^^^ + kTn + g{y, z))]7t{z). 
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Note that by part (1) of Assumption 1.2, this lemma apphes ii k = 
0((log|K|)2). 

Proof. Using that P^[X{cTk) = z] = (l + 0(e-^^'=))7r (z), an appHcation 
of Bayes' formula yields 

P^[X{cTk) = z\T{y)>cTk] 
^ P..[r(,)^cT.|X(cT.) = .] ^ ^ 
Px[r(y) > cTk] 

The idea of the rest of the proof is to show that it is unlikely that X hits y 
close to time cT^, in which case we can use a mixing argument to show that 
conditioning on X{cTk) = z has little effect. For 1 < c < c + 1 < c, we have 

Px[T{y)>cn\X{cTk) = z] 
=P,[t(2/) > in^k\X{cTk) = z]- P,[cTfc > r(y) > 2rfc|X(cTfc) = z]. 

By time-reversal, a union bound, and mixing considerations, it is easy to see 
that the second term on the right hand side is bounded by 

P.[T(y) < ZTk\X{cTk) = x]= 0{g{y, z) + kTn). 

Applying Bayes' formula, observe 

p.[r(y) > mx{cn) = z] = p^[x{cn) = z] - 

= (l + 0(e-'=^(^-^))P,,[r(y)>2Tfc]. 

Similarly, 

Px[r{y) > cTk] = Px[r{y) > ^k] - Px[cTk > r{y) > ET^], 

where the second term on the right hand side is of order 0(/cT„). We now 
take c = c/2 and 7 = 7/2. By part (3) of Assumption 1.2, we have that 

Px[T{y) > kT^i,] >l-po- 0{kTn) 

uniformly in n. In particular, there exists po > so that if A;T„ < pQ then 
Px[T{y) ^ ^^mix] + 0{kTn) is Uniformly positive in n. Putting everything 
together, for such k we thus have 

P^[X{cTk) = z\T{y)>cTk] 

(1 + 0(e-^^fe))P.[T(jy) > STfe] + 0{g{y, z) + fcT„) . . 

= p r ^ A ^ 1 I n^^-T- ^ (1 + 0(6 ^ ))7r(z) 

= (1 + ©(e-^^'^ + <7(?/, ^) + A:T„))7r(z), 
as desired. □ 
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In the following lemma, we will show that the difference in the proba- 
bility that a random walk hits points y, z when started from x before time 
Tn = co(logn)Tjnix is essentially determined by the corresponding difference 
except up to time cTmix- The reason for the cancellation is that the previous 
lemma implies that conditional on not hitting a given point up to time cTmix, 
the walk is well mixed and has long forgotten its starting point. Recall that 
/c(x,?/) = Px[r(y) < cTmix]- 

Lemma 3.3. There exists 7 > such that for all c> 2, 

Px[r{y) < r„] - P,[t(z) < r„] = f,{x, y) - Ux, z)+ 
0(e-^^T„ + A„[<7(x,y)+5(x,z)]). 

Proof. We observe, 

Px[r(y) <r„] = /e(x,y)+ 

^PxicTk < T{y) < cTk+i\T{y) > cTfc](l - P^[T{y) < cTk]), 

k 

where here and throughout the rest of this proof the summation over k is 
from 1 to ^ log \ Vn\- As the previous lemma is applicable for such choices of 
k, we thus have 

Px[cTk <T{y)< cTk+i\T{y) > cTk] 
=P^[cTfc < r(y) < cTk+i\T{y) > cTk] + 

O (^^g{y,w){e-^'' + kTn + g{y,w))7r{w)^ . 
Performing the summation, we see that the latter term is of order 

(3.4) o{rr,e-^'^'' + krl + Sn\Vn\-'). 

Recall from part (2) of Assumption 1.2 that 5„ = o(Tmix/ log hence 
(log = o(T„). Consequently, summing (3.4) over k from 1 to 

^ log \Vn\ gives an error of 0{Tne~^'^ + A^). By part (1) of Assumption 1.2 
it is clear that = o(T„), hence the former is of order 0{Tne~^'^). This 
leaves 

'^P^[cTk < T{y) < cTk+i\T{y) > cTk\P^[T{y) < cTk] 
k 

=0 < cr„,ix]vr(z)(5(x,y) + /cT„)^ . 
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Here, we used the bound Pa;[r(y) < cT^] = 0{g{x,y) + kTn) along with 
the previous lemma to get the crude estimate Px[X{cTk) = z|r(y) > cT^] < 
Ctt{z) for some C > 0. Summing everything up gives us an error of order 
0{/S.ng{x, y) + A^). We also have another contribution of 0{/S.ng{x, 2;) + A^) 
coming from the corresponding estimate of 'Px[t{z) < r„]. Therefore our 
total error is 0{Tne~'^'^ + An[g{x, y)+g{x, z)]), which proves the lemma. □ 

4. The Radon-Nikodym Derivative. Let 7r{z; x, y) = 'F[X2{ti{x, y)) ■ 
z\}i{x^y)\. The purpose of this section is to prove the following estimate 
of the Radon-Nikodym derivative of TT{z;x,y) with respect to n{z). Recall 
fc{x,y) = Px[T{y) < cTmix] and = J2y fc{x,y)TT{yy, we are omitting the 
dependence on n to keep the notation simple. 

Theorem 4.1. There exists a constant 7 > so that for all c > 2 we 
have 

^^^^ = 1 + (1 + 0(A„)) (27, - z) - Uy, z) 
+ Oie-^'^Tn) + 0{[g{x, z) + g{y, z) + A„] [g{x, y) + A„]). 
In particular, 

(4.1) ^^^^^ = 1 + 0{9{x. y) + 9{y, z) + g{x, z)). 

Let Y2 = X2{ti{x, y)). The idea of the proof is to observe that 

nH{x,y)\Y2 = z]T,{z) 



7r(2;; x, y) = P[y2 = z\H{x, y)] 



nH[x,y)] 



since Xi , X2 are independent and the initial distribution of X2 is stationary, 
then estimate the effect of conditioning on Y2 = z on the probability of 
H{x, y). We will divide the proof into three lemmas. Let 

^{x,y) = {T2{x,y) > Ti{x,y) - r„,Gi(x,y)} \ H{x,y) 
= {Ti{x,y) > T2{x,y) > Ti{x,y) - r„,Gi(x,y)}. 



Lemma 4.2. Uniformly in x, y, z, n, 

^z-x y) ^ P[A{x,y)]- P[A{x y)\Y2 = z] ^ o(|^^|-ioO)_ 
Tr{z) P[H{x,y)\ 
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Fig 1. In Theorem 4-1, we give a precise estimate of the Radon-Nikodym derivative of the 
law of Y2 — X2{Ti{x,y)) with respect to the uniform measure on V„ conditional on the 
event that H{x,y) = {fiix) = ri{x,y) A T2{x,y)}, i.e. that the first point m {x,y} hit by 
Xi , X2 is X by X\ . The open circles indicate the starting points of X\ , X2 and the shaded 
circle is Y2. 

Proof. Letting R{x,y) = {T2{x,y) > Ti{x,y) — Tn,Gi{x,y)}, observe 
P[H{x,y)\Y2 = z]= P[R{x,y)\Y2 = z] - P[A{x,y)\Y2 = z]. 

We will now manipulate the first term on the right hand side. Let R{x, y) = 
Gi{x,y) \R{x,y). We have, 

(4.3) P[R{x,y)\Y2 = z] = P[Gi{x,y)\Y2 = z] - P[R{x,y)\Y2 = z] 
and, since Y2 ~ vr, Bayes' rule implies 

P[R{x,y)\Y2 = z] = -^,P[Y2 = z\R{x,y)]P[Rix,y)]. 

Since the conditional probability on the right hand side involves conditioning 
on the behavior of X2 before ti{x, y) — r„, mixing considerations imply that 
this is equal to 

(4.4) [l + 0(|Kr^^°)]Pfey)] =P[i?(x,y)] + 0(|K|-^^o). 

As P[Gi{x,y)\Y2 = z\ = P[Gi(a;,y)], combining (4.3) with (4.4) we thus 
have 

P[H{x,y)\Y2 = z] =P[R{x,y)] - P[A{x,y)\Y2 = + 0(|K|-^^°) 

=P[H{x,y)]+P[A{x,y)] - P[A{x,y)\Y2 = z] + OdKl'^'^''). 
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Fig 2. Illustration of the event A(x,y). The solid lines are used to indicates the parts 
of Xi,X2 up to the time T2{x,y), while the dashed line is used for the part of Xi after 

T2{x,y). We have not indicated the part of X2 after T2{x , y) , Note that we have X2 hitting 
y first, hut A{x, y) allows for X2 to hit x first as well. On the other hand, A(x, y) requires 
that X\ does in fact hit x before y. 



Assume that 7C0 > 100. Putting everything together, we see that 



^(z; X, y) _ V[H{x, y)\ + P[A(x, y)] - P[A(x, y)\Y2 = z] , ,_ioo, 



<z) P[^(x,y)] 
uniformly in x, y, z, n. □ 

Note that if F„ = for d > 3 then P[Gi(x, y), Gi2(x, y)] = P[H{x,y)] 
does not change when x is swapped with y nor when 1 is swapped with 2 and, 
moreover, is equal to | up to negligible error. This holds more generally if 
for every x,y £ Vn distinct there exists an automorphism of Gn such that 
(/^(x) = y and (p{y) = x. The weaker hypothesis of vertex transitivity implies 
that we can always find an automorphism of Gn such that ^p(x) = y but 
not necessarily so that ip{y) = x as well. Nevertheless, it is still true in this 
case that P[H{x,y)] \. 

Lemma 4.3. We have that 

Proof. Let A{x, y) = {ri(x, y) > T2(.t, y) > ti{x, y)-riv} and fi{z-, x, y) = 
P[Y2 = z\Ti{x,y) < T2{x,y)]. Using exactly the same proof as the previous 
lemma, we have 



Tr{z) 



= 1 + 0(P[A(x, y)] + P[A{x, y)\Y2 = z] + 
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Consequently, using the trivial bound 

P[A{x, y)\Y2 = z]= 0(A„ + g{x, z) + g{y, z)) 
(and similarly for P[A(x,y)]), we have that 

XI z)n{z; X, y) = Tn + X g{x, z)0 (A„ + g{x, z) + g{y, z)) 
= T„ + -^0(5„ + A„rrnix). 

I 

By parts (1) and (2) of Assumption 1.2, we have that 5„ + A^Tmix 
o(Tinix/(log |V"„|)). Consequently, the above is equal to 



(4.5) ^" + ^1 IT/ I 

Let Pa;' = P[G'i(x,y),Gi2(x,y)] and = P[Gi(y, x), Gi2(x, y)]. Note that 



(4.6) 



p,+Pj, = P[Gi2(x,y)] = ^ + o(^^^ 



since P[ri(z) = T2(tL')] < P[X2(ri(z)) = = ^ for any z,w £ Vn- 

Define stopping times as follows. Let 

Ti = min{t > 1 : Xi{t) £ {x,y} or X2(t) G {x,y}} = ri(x,y). 

For J > 1, inductively set 

Tj+i = mm{t > Tj + Tinix + 1 : Xi{t) £ {x, y} or X2(t) G {x, y}}. 

Let Tj,, = EtitJ""'" l{x(t)=.} and, for E C K, set Aij{E) = {Xi{Tj) G S}. 
Note that the average amount of time spent at x by Xi through time Tfc+Tmix 
is given by the expression 

^ + rp^.^ Yj ('^Ai,(x)^A-^ix,y)Tj,x + lA,,{y)^A-^{x,y)Tj,x + '^A2j(x,y)TjA- 

i=i 

It is not difficult to see that the above quantity converges to 7r(x) ask ^ oo. 
We can also define a similar quantity but replacing x with y which will 
converge to 7r(y) as fc — )• oo. Taking the ratio of these two quantities, we 
arrive at 



1 v^fc 

k 

1 = lim 

fc— >oo 



Y!j=l ^lAi,(x)lyl=^.{x,?/)77> + lAij{y)lA§^.(x,|/)77> + '^A2j{x,y)Tj,^ 
1 Ej = l (^lAij(x)lA^^.(a;,2/)7^-,i/ + '^A^,{y)'^Al^{x-y)Tj,y + 1^2, (x,?;)7?\!/ ^ 
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since 7r(x) = 7r(y). It is not difficult to see that, almost surely, 
1 

ii,™ k ^A,,{x)^Al^{x,y)Tj,x = PxQix, X), 

i=i 

1 ^ 

°° i=i 
1 

r -"-^2^(^.2/) "^'^ = fey Vfi'(2;,x)/i(z;X,7/), 

where qxy = 1 — Px — Py Analogous formulae hold for the terms in the 
denominator. Combining this with (4.5), we thus have 

^ ^ Pxgjx, x) + Pygjy, x) + g^y Y.z s{z, x)n{z; x, y) 
Py9{y, y) + Pxg{x, y) + qxy Y.Z 9{z, y)l^{z; X, y) 

^ Pxgjx, X) + Pygjy, x) + QxyTn ^^f Tn \ 
Pygjy, y) + Pxgjx, y) + qxy^n \\og\Vn\) ' 

Rearranging and using that gjx,y) = g{y,x) and g{x,x) = g{y,y), this 
implies that 

Combining this with (4.6) proves the lemma. □ 

In order to complete the proof of Theorem 4.1 we need to estimate 
P[j4(x, ?/)|y2 = z], which is this purpose of the following lemma. Though 
the proof will be computationally intensive, the basic idea is fairly simple. 
The main goal is to eliminate the conditioning on I2 = -Z- The first step 
is to perform a time reversal, which converts the terminal condition to an 
initial condition at the cost of making the event whose probability we are 
to compute a bit more complicated. The latter is easily mitigated, however, 
since the event can be greatly simplified at the cost of negligible error. 

Lemma 4.4. There exists 7 > so that for all c>2 we have 

P[A{x, y)\Y2 = z] = Q + 0(A„)^ [Ux, z) + Uy, z)] + E,{x, y)+ 

0(e-^^T„ + [g{x, z) + g{y, z) + A„] [g{x, y) + A„]) , 
where Ec{x,y) is some constant which does not depend on z. 
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Note that the lemma imphes 
P[A{x, y)] - P[A{x, y)\Y2 = z] = 0{g{x, y) + g(y, z) + g{x, z) + e-^^T„). 
Proof. Let 

B{x,y) = {X2{t) i for ah t G (r„, ti(x, y)], Gi(x, y)} 

and let Pt,^z be the law of {Xx.X'i) where Xi(0) ~ vr and ^2(0) = z. We 
compute, 

P[A(x, y)\Y2 = z\ = ^P[^(x, y),Y2 = z] 
= ^P7r,«,[n(a;,y) > T2(x,y) > Ti{x,y) - r„,Gi(x,y),y2 = A 

w 

By reversing the time of X2, we see that this is equal to 

^P-K,z['^2{x,y) < Fn /\Ti{x,y),B{x,y),Y2 = w] 

w 

(4.7) =P^,,[r2(x,y) < r„ Ari(x,y),5(x,y)]. 

We will now work towards simplifying the event, at the cost of some error. 
We begin by eliminating the "minimum" operation using the observation 
that it is unlikely that both Xi,X2 hit quickly. Indeed, as 

P7r,2[-ri(a;,y) < r„,T2(x,y) < r„] = 0{[g{x,z) + g{y,z) + A,„]A,„) 

we see by setting B{x,y) = B{x,y) n {ri(x,y) > r„} that (4.7) is equal to 

FnAMx, y) < r„, B{x, y)] + 0{[g{x, z) + g{y, z) + A„]A„). 

We would now like to eliminate the dependence of the probability on z, the 
starting point of X2. We accomplish this by considering two possible cases. 
Either X2 hits x or y within some multiple of the mixing time or it does 
not and, conditional on the latter, the walk will have mixed, so the relevant 
probability does not depend on z. We implement this strategy as follows: 

PttAM^^v) <r„,5(x,y)] =Pn,z[Mx,y) < cTrab„B{x,y)] + 

P7r,z[T2(x,y) < Tn,B{x,y)\T2{x,y) > cTmixKl - Pz[T2ix,y) < cTmix])- 

Using the same proof as Lemma 3.2 except in the case that random walk is 
conditioned not to hit two points rather than just one implies //(w; x, y, z) = 
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Pz[-^2(cTmix) = w\T2{x,y) > cT^ix] < C-k{w) for some constant C > 0. 
Consequently, 

PnAM^^y) < r„,B(x,y)|r2(x,y) > crmix]P^[T2(x, y) < cTmix] 
<CP^[r2(x,y) < r„]P,[r2(x,y) < cT^i^] = 0{[g{x, z) + g{y, z)]An). 

We are left with two terms to estimate: 

(4.8) Pn,z[T'2{x,y) < cTraix,B{x,y)], 

(4.9) Pn,z[T'2{x,y) < Tn,B{x,y)\T2{x,y) > cTmix]- 

We will first deal with (4.8) which, using the independence of Ti{x,y) and 
T2{x, y), we can rewrite as 

PT,^z[B{x,y)\Ti{x,y) > r„,T2(x,y) < cTrai^jP ;,[t2{x , y) < crmix]P[Ti(a;, y) > r„] 

Since B{x,y) depends on X2{t) only for t > r„,, from mixing considerations 
it is easy to see that 

P-!r,z[B{x,y)\Ti{x,y) > Tn,T2{x,y) < cTmix] 

=P[B{x,y)\nix,y) > r„] + 0(1^1"'°°). 
Consequently, (4.8) is equal to 

P,[r2(x,y) < cT^ix]P[5(x,y)] +0(|Kri°°). 

Note that 

B{x, y) = {H{x, y) n {t{x, y) > T,}) U {B{x, y) D {t{x, y) < r„}). 
Using P[t{x, y) < r„] = 0(A„), the previous lemma thus implies 

P[B{x, y)] = P[H{x, y)] + 0(A„) = ^ + 0(A„). 

Observe 

Pz[T2(x),r2(y) < cTmix] = Pz[r2ix) < T2{y) < cTinix] +P2[T2(y) < T2{x) < cTmix] 

= 0{g{x, z)g{x, y) + g{y, z)g{x, y)). 

Consequently, 

Pz[t2{x, y) < cTmix] = fcix, z) + /c(y, z) - Pz[T2{x),T2{y) < cTmix] 

=fc{x,z) + fc{y, z) + 0{[g{x, z) + g{y, z)]g{x,y)). 
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The final part of the lemma follows since, arguing as in the proof of Lemma 
3.3, we can estimate (4.9) as follows: 



Taking Ec{x,y) = P[r2(x,y) < Fn, B{x,y)\T2{x,y) > cTmix] finishes the 



Combining the three lemmas immediately gives Theorem 4.1. 

5. The Variance. We will complete the proof of Theorem 1.3 in this 
section. The general theme is to eliminate asymmetry wherever possible. We 
first apply this idea by considering 



in place of |^i|. In addition to being symmetric in Xi,X2, note that B also 
differs from in that we have eliminated those sites whose marking is de- 
termined by the flip of a fair coin. These, however, do not make a significant 
contribution since it is a rare event that both walks hit a particular point 
for the first time simultaneously. In particular, we will show in Lemma 5.1 
that Var(^) w 4Var(|^i|), up to negligible error, so to prove Theorem 1.3 
it suffices to show that 



Fn,z[T2{x,y) < ^n,B{x,y)\T2{x,y) > cTmiJ 

■F[T2{x,y) < Tn,B{x,y)\T2ix,y) > cTmix] 
+ 0{e-^'Tn + ig{x,z)+giy,z))An). 



proof of the lemma. 



□ 




X 



X 



Var(fi) = 5^(/c(x,y) - + 0[e-^\T^^f). 



It is more convenient to work with B as 



Var(S) = 2^^ P[Gi2(x),Gi2(2/)] - P[Gi2(x),G2i(y)] 



= 4 J] P.,.[Gi2(y)] - P.,.[G2i(y)] ^(^; x, y)P[F(x, y)] 




Applying Lemma 4.3, we can rewrite the above as 




x,y,z 
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We will show at the end of this section that (5.2) is negligible. Note that 



PxAGijiy)] =Px,zh{y) < Tj{y) < r„] +P,,,[r,(y) < r„,r,-(y) > r„] 



In subsection 5.2, we break the sum in (5.1) into three different cases based 
on the time decomposition above and bound each in a given lemma. It will 
turn out that the error coming from the terms corresponding to A and C is 
negligible (Lemmas 5.3, 5.4). The reason for the former is that it is unlikely 
that both Xi,X2 hit y quickly and the latter follows as, conditional on 
having not hit y by time r„, both walks have long forgotten their initial 
conditions and are well mixed. This leaves B, which dominates the variance. 
Its asymptotics will be computed (Lemma 5.2) by reducing the estimate to 
a computation involving 7f(z;x,y), whose Radon-Nikodym derivative with 
respect to the uniform measure has already been estimated quite precisely 
in Theorem 4.1. 

5.1. Symmetrization. 

Lemma 5.1. We have 



Proof. Let {S,n{x) : x G Vn) he iid random variables independent of 
Xi,X2 with P[U{x) = 1] = P[Cn{x) = 2] = I and let A{x,i) = {ti{x) = 
T2ix),^nix) = i}- By definition. 



(5.3) 



+ P^,,[r„ < Ti{y) < Tj{y)] = A + B + C. 



Var(e) = 4Var(|^i|) + y^T~V^^{\A^ + T, 



mix 



as n —)• OO. 




Observe, 




By the strong Markov property and independence of Xi , X2 , the above is 
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bounded by twice 

Y^P.Aniy) = T2(y)]P[Ti(x) = T2ix)] 
x,y t 

Using that Px[r{y) = t] < Px[X{t) = y] and X2(ti(x)) ~ vr when X(0) ~ vr, 
we have the further bound 

-1 / 4Tniix 

(5-4) E E = + E = t])^ 

Summing the first term over x, y, t plainly yields 4Tmix- For the second term, 
note there exists C > so that for t > 4Tmix we have 

Px[r{y)=t]<P^[X{t)=y]< ^ 



hence 

^ {Px[r{y)=t]f<^ ^ PAr{y) = t]<^ 

Therefore the second term in the summation in (5.4) is 0(1). The lemma 
now follows from Cauchy-Schwarz. □ 

5.2. Conditioning. We begin by estimating the part of (5.1) correspond- 
ing to "S" from (5.3). 

Lemma 5.2. We have 
^ (PxA^iiy) < r„,T2(y) > r„] - PxAMy) < r„,ri(y) > r„] j7r(z;x,y) 

x,y,z ^ ^ 

(1 + 0(A„)) ^(/c(x, y) - + 0(e-^'=(T^ix)2). 
Proof. Note that 



PxA'^iiy) ^ r„i'^2(y) > r„] - PxAMy) < r„,Ti(?/) > r„] 

=Px[ri(y) < r„] - P,[T2{y) < r„]. 
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Let 6i{x, y, z) = 0(e '^"Tn + {g{x, z) + g{y, z) + A„)(g(x, y) + A„)) be the 
error term from Theorem 4.1 and 52(x,y,z) = 0{e~'^'^Tn + An{g{x,y) + 
g{x, z))) be the error from Lemma 3.3. Then we can rewrite the summation 
in the statement of the lemma as 

(Px[Ti(y) < Tn]-Pz[T2{y) < Tnm{z;x,y) 

x,y,z ^ ^ 

^ E ^ r„] -P,[T2(y) < Tn]{l + e{x,y,z))^ + 



x,y,z 



riTT X] ( ~ /c(y>^)l + h{x,y,zU6i{x,y,z) = Bi + B2 

where, by Theorem 4.1, 

e(x, y, z) = {l + 0(A„)) (2/, - /^(x, z) - z)). 

Applying Assumption 1.2 repeatedly, it is tedious but not difficult to see 
that B2 = O(e-'^^r^i^). By Lemma 3.3, 

i2f,-Mx,z)-My,z)). 

Multiplying through, using the symmetry of / in its arguments, and cancel- 
ing many terms, this becomes 

(1 + 0(A„)) ^(/.(x, y) - 7,)2 + 0(e-^^(r^ix)2). 
x,y 

□ 

We will now show that the part of (5.1) coming from "j4" of (5.3) is 
negligible. Roughly, the reason for this is that it is unlikely that both walks 
hit y quickly, though in order to get a sufficiently good bound we will need to 
take advantage of some of the cancellation inherent in the situation. This will 
require us to invoke (4.1), which is a rough estimate of the Radon- Nikodym 
derivative of vr with respect to vr. 

Lemma 5.3. Uniformly in n we have 
Y ( Pa;,^[-ri(y) < T2{y) < LJ -PxA^-iiy) < n{y) < Tn]jTT{z;x,y) 

x,y,z ^ ^ 

= o(r2jj 
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Proof. By (4.1), the summation in the statement of the lemma is equal 

to 



(Px,^[n(y) < r2{y) < T„] - P.AMy) < n{y) < r„]) 

' ' x,y,z 

il + Oigix,y)+giy,z)+g{x,z))). 
By symmetry, we see that this is equal to 

(Px,^[n(y) < r2{y) < r„] - P.A'^2iy) < n{y) < r„]) 

{0{g{x, y) + g{y, z) + g{x, z))). 

Using 

Px,.[ri(y) < T2{y) < r„] < Va^M{y) < r„,r2(y) < r„] 
and the independence of Xi,X2, we have the further bound 

Y 0{[g{x, y) + An)igiy, z) + A„))0(5(x, y) + g{y, z) + g{x, z)). 

By the symmetry of g in its arguments, we can rewrite this as 
fjTT Y ^^9{x, y)g^{y, z) + g'^{x, y)A„ + g{x, y)A.^ + g{x, y)g{x, z)An+ 

g{x,y)g{x,z)g{y,z)). 

Repeated applications of Assumption 1.2 and a tedious but easy calculation 
give the lemma. □ 

We complete this subsection by proving that "C" from (5.3) is also neg- 
ligible. The intuition for this is that by time r„ both walks are very well 
mixed hence given that both have not hit y, the difference in the probability 
that one hits before the other is of smaller order than any negative power 
of (though we chose to write —100). The proof will be in a slightly dif- 
ferent spirit than the previous lemmas since estimating the Radon- Nikodym 
derivative of the conditioned laws with respect to vr will not quite suffice. 

Lemma 5.4. We have 

PxA^n < ri{y) < T2{y)] - P.,,[r„ < T2(y) < n{y)] = 0(|Kr'°°). 
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Proof. The idea of the proof is to use a standard couphng argument 
to show that, conditional on {Ti{y) , T2{y) > r„}, the laws of Xi{Tn) and 
-^2(rn) have total variation distance 0(|14i|~^'^'') independent oix, z. To this 
end, we set n{z;x,y) = Px[X{Tn) = z\T{y) > r„]. Let Y{t) be the process 
given by X{t) conditioned on the event {t(?/) > Tn}- Then Y(t) is Markov 
(though time-inhomogeneous) as 

P[Y{t) = z\Y{l) = zi, . . . , y(t - 1) = zt-i] 
=P[X{t)=z\X{l) = zi,...,X{t-l) = zt^i,T{y)>Tn] 
^ P[X{t) = z,T(y)>rn\X{l) = zi,...,X{t-l) = zt-i] 
P[r(y) > r„|X(l) = zi,...,X{t-l) = zt-i] 

^ p,,_jx(i) = z,T(j/)>r„-(t-i)] 

p.,. Jr(y) > r„ - - 1)] 
depends only on z, zt-i. Recall that T^. = /cTmix- For t = cT^, note that 

Px[r(y) > r„,|r(y) > t] 

^ Pz[T{y) >Tn- t]Px[X{t) = z\T{y) > t] 

Px[T{y)>Tn\T{y)>t] 
Combining part (3) of Assumption 1.2 with Lemma 3.2, we have that 
Pz[r{y)>Tn-t] 



Pxlriy) > r„|r(2/) > t] 



9(1) for z^y. 



Also, since YlzdiU'^) ~ '^mix and T^ix = o(|Ki|), it follows that for each 
e > fixed, with A = {z : g{y, z) < e} we have |A|/| Vn| = 1 — o(l). Lemma 
3.2 also implies that Px[X{t) = z|r(y) > t] = 0(l)7r(z) on A uniformly in n 
large provided that provided k is large enough and e > is sufficiently small. 
This implies that we can couple together the laws Yu{cTi:),Yy{cTf^) starting 
at u,v distinct so that with probability po > we have Yu{cTk) = Ytj{cTk). 
If we iterate this procedure C2 = ^ log \ Vn\ times, tj = 7]{c, k, pq), we get that 
with probability 1 — 0(| V'ril"'^^) we have y„(r„) = yy(r„). Consequently, we 
may assume that cq is sufficiently large so that 

max||i/(-;r„,7x) - i/(-, ; r„, = 0(|K|-^°°). 

u,v 

Let 17 be a measure so that max^j ||z^(-; r„, u) — X^Hj^y = 0(| V^l"^^*^). Let 
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D = {Ti(y), T2(y) > r„}. Then we have that 

< niy) < T2(y)] - Px < My) < Ti(y)] 

= Y,('PuAGi2{y)]-Pu,vP[G2i{y)])Hu;Tn,xp{v;rn,z)PxjD] + 

u,v ^ ^ 

=0(|K|-^°°). 

□ 

Proof of Theorem 1.3. To finish the proof of Theorem 1.3 we just 
need to take care of the term in (5.2). The previous lemma imphes 

^|Px,.[Gi2(y)]-P.,.[G2i(y)]| 

x,y,z 

= ^ |P.,.[Gi2(y),ri(y) < r„] - P:.,.[G2i(y),r2(y) < Tn]\ + 0{\Vn\-''^). 

x,y,z 

Observe {Gijiy),n{y) < r„} C {ri{y),Tj{y) < T^} U My) < r„ < Tj{y)}. 
Thus we can bound from above the previous expression by 

^ f2Px,z[n{y),T2iy) < r„] + iPAniy) < r„] -P,[r2(y) < r„]|j = + ^2- 

x,y,z ^ ^ 

The term corresponding to Ei can be bounded in a similar manner as 

in the proof of Lemma 5.3. Indeed, by the independence of Xi,X2, we have 

that 

Px,z[n{y),T2{y) < r„] < {g{x,y) + An){g{y,z) + An) 

which, when summed over x, y, z, is of order 0((log jl^D^l^nlT'mix)- '^^^ 
estimate E2 using techniques similar to the proof of Lemma 5.2 since by 
Lemma 3.3, 

\Px[Ti{y)<Tn]-Pz[T2{y)<rn]\=0{g{x,y) + g{y,z)+62{x,y,z)), 

where, as in the proof of Lemma 5.2, 52{x,y,z) corresponds to the error 
from Lemma 3.3. When summed over x,y,z, this is of order 0(| KipTinix). 
Therefore 



^ ^ + o(— 

\Vn\nog\Vn\J \\Vn 

as desired. □ 



32 JASON MILLER 




(a) Zfoo (b) Zta (c) T.f 

Fig 3. Q-Q plots based on 20,000 simulations of the number of sites visited by X\ before X2 
against an appropriately fitted normal distribution, supporting the conjecture of asymptotic 
normality. 



6. Further Questions. 

1. The first step in proving a sequence of random variables {Xn) has a 
Gaussian limit after appropriate normalization is the determination 
of the asymptotic mean and variance. We remarked in the beginning 
that, in our case, the expected number of sites colored blue is |Ki|/2 
and Theorem 1.3 gives the limiting variance. Figure 3 shows Q-Q plots 
of the empirical distribution of the number of sites painted 1 in the 
final coloring against an appropriately fitted normal for three different 
base graphs. Based on these plots, we conjecture that 

I '^i I E I I 

VVardAI) 

has a normal limit for all graphs satisfying Assumption 1.2. 

2. Our derivation of the variance ignores the time aspect of the problem 
in the sense that it gives no indication of at what point in the process 
of coverage the variance is "created." Does it come in bursts or con- 
tinuously? Does it come sooner than any multiple of the cover time 
or perhaps in [eTcov , 7"cov] ? More generally, when normalized appro- 
priately, does the the process t 1— )• l|^^(2,)<^2(z)<t} have a scaling 
limit? 

3. We make repeated used of the symmetry afforded by the fact that 
we consider two random walks moving at the same speed on vertex 
transitive graph. It would be interesting to see if a similar result holds 
when the various degrees of symmetry are broken. Starting points 
for exploring this problem include considering continuous time walks 
moving at various speeds, multiple walks, and graphs which are not 
vertex transitive. 
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4. Theorem 1.1 only holds for tori of dimension d > 3 as the case d = 2 
falls just outside of the scope of Theorem 1.3. It would be interesting 
to see a more refined analysis carried out to handle this case. 

5. That the variance computed in Theorem 1.1 for d = 3, 4 is significantly 
larger than in the iid case suggests that the clusters which have an 
unusually large number of sites painted a given color are either larger 
or more dense than in an iid marking. How large and frequent are such 
clusters? What is their geometric structure? 

6. Another interesting quantity is the size B of the boundary separating 
the sites painted 1 and 2, as studied in [ ]. It is not difficult to see that 
there exists a constant f3d > such that E|i3| ~ f^du'^ when d > 3 as 
n — )• oo. Indeed, this follows since the probability that {Ti(y) < T2{y)} 
for y ~ X given {ti{x) < T2{x)} converges to a limit pd £ (0, 1). Note 
that this is of the same order of magnitude as E|^i | . Is it also true that 
Var(|;S|) = G(Var(|^i|)) or do these quantities differ significantly? 
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